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1. Introduction. 

Consider a real random variable (rv) Y and a sequence of strictly stationary 
rv's {Yi)i>i with common unknown absolutely continuous disribution func- 
tion (df) F. In survival analysis, the rv's may be the lifetimes of patients 
under study. Let (Ci)j>i, be a sequence of censoring rv's with common un- 
known df G. In contrast to statistics for complete data studies, censored 
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model involves pairs (Tj, ii)i=\^...^n where only Tj = Yil\Ci and bi = 1^Yi<Ci} 
are observed. 

Let X be an IR'^-valued random vector. Let (Xj)j>i be a sequence of copies of 
the random vector X and denote by Xj^i, • • • , X-i^d the coordinates of Xi. The 
study we perform below is then on the set of observations (Tj, 5j, Xj)j>i. In 
regression analysis one expectes to identify, if any, the relationship between 
the YiS and Xj's. This means looking for a function m*{X) describing this 
relationship that realizes the minimum of the mean squared error criterion. 
It is well known that this minimum is achieved by the regression function 
m{x) defined on IR'' by 

m{x)= W.{Y\X = x). 

There is a wide range of literature on nonparametric estimation of the regres- 
sion function and many nonlinear smoothers including kernel, spline, local 
polynomial, orthogonal methods and so on. For an overview on methods and 
results for both theoretical and application points of view considering inde- 
pendent or dependent case, we refer the reader to Collomb [13], Silverman 
03], Hardle Wahba Wand and Jones [47], Masry and Fan Cai 
[8] and Cai and Ould-Said [9]. 

In the uncensored case, the behavior of nonparametric estimators built upon 
mixing sequences is extensively studied. The consistency has been investi- 
gated by many authors. Without exhaustivity we quote Robinson [39} HQ]. 
Collomb [13], Roussas [42] and Laib [29j. Some other types of dependence 
structure have been considered. We refer to Yakowitz [l9] for Markov chains, 
Delecroix [18], Laib and Ould-Said [30] for ergodic processes. Hall and Hart 
|24j for long-range memory processes, Cai and Roussas pjj for associated 
random variables. Collomb and Hardle [15] obtained the uniform conver- 
gence with rates and some other asymptotic results for a family of kernel 
robust estimators under a 93-mixing condition, whereas Gonzalez-Manteiga 
et al. [22j developed a nonparametric test, based on kernel smoothers, to 
decide whether some covariates could be suppressed in a multidimensional 
nonparametric regression study. Under the a-mixing condition, the uniform 
strong convergence of the Nadaraya- Watson estimator is treated in Doukhan 
[19j . Bosq ^ and Liebscher ^34i- Roussas ^42j established the consistency 
with rate of the regression estimator under some summability requirement. 
Techniques used in the estimation of nonparametric regression are closely 
related to density estimation; in this case, kernel estimators have been ex- 
tensively studied: See for example Roussas [H], Tran ^44J, Vieu [46] and 
Liebscher [331 [35]. Cai and Roussas [TO] established the strong convergence 
of the kernel density with rate in the muldimensional case, while Tae and 
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Cox [33] established the same result with a slight difference on the rate. 
Andrews [T] provides a comprehensive set of results concerning the uniform 
almost sure convergence. Masry [36j derived sharp rates for the same kind 
of convergence, but confined attention to the case of bounded regression. 
Our goal is to establish the strong uniform convergence with rate for the 
kernel regression estimate under a-mixing condition in random censorship 
models. For this kind of model, Cai [HE] established the asymptotic proper- 
ties of the Kaplan- Meier estimator. The strong convergence of a hazard rate 
estimator was examined by Lecoutre and Ould-Sai'd [3T] while Liebscher [35] 
derive a rate uniform for the strong convergence of kernel density and hazard 
rate estimators. His result represents an improvement of that given in Cai 
[6]. The consistency results concerning the nonparametric estimates of the 
conditional survival function introduced by Beran [2], Dabrowska [16' 17] 
in the iid case, were extended by Lecoutre and Ould-Sai'd [32j to the strong 
mixing case. We point out that, for the independent case, the behavior of 
the regression function under censorship model has been extensively stud- 
ied. We can quote Carbonez et al. [I2], Kohler et al. [28] and Guessoum 
and Ould-Sai'd [23j. However, few papers deal with the regression function 
under censoring in the dependent case. 

To this end, we were interested in extending the result of Guessoum and 
Ould-Sa'i'd [23] from the iid to the dependent case. The paper is organized as 
follows: In Section 2 we give some definitions and notations under the cen- 
sorship model of the regression function and strong-mixing process. Section 
3 is devoted to the assumptions and main result. In Section 4, some simu- 
lations are drawn to lend further support to our theoretical results. Proof 
with auxiliary results are relegated to Section 5. 

2. Definition of estimators 

Suppose that {Yi,i > 1} and {Ci,i > 1} are two independent sequences of 
stationary random variables. We want to estimate m[x)= Wj(Y \X = x) 



with y) being the joint density of (X, Y) and the density function 

of the covariates. 

Now, it is well known that the kernel estimator of the regression function 
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m(-) under censorship model (see, eg Carbonez et al. [12j) is given by 

n r rp 

fh^ix)=Y^WUx)^p^^ (2) 
where G is the survival function of the rv C and 



W,Jx) 



are the Watson-Nadaraya weights, is a probability density function (pdf ) 
defined on IR*^ and hn a sequence of positive numbers converging to as n 
goes to infinity. Then ([2]) can be written 

where 

1 ^ SiTi _ fx-Xi\ , _ , I fx-Xi 

d 



ri n(x) = ——7 -^^ J, , ( — ; -\ and in(x) = ——r Kfj 

' ^ ' nhi ^ G{ Ti) ''K K ) ^ ' nhi{r{ 



hn 

(3) 

In practice, G is usually unknown, we replace it by the corresponding Kaplan- 
Meier [27] estimator (KME) G„ defined by 

0, ifi>^(n)- 



The properties of the KME for dependent variables can be found in Cai 



[5l[7]. Then a feasible estimator of m(x) is given by: mn{x) = ^\'"('^\ where 



is an estimator of ri{x) and (defined in ^) an estimator of l{x). 

In what follows, we define the endpoints of F and Ghy rp = sup {y, F{y) > O}, 
tg = sup{y, G(y) > 0} and we assume that Tp < oo and G{tf) > (this 
implies Tp < tg)- 

For technical reasons (see Lemma 5.1), we assume that {Gi,i > 1} and 
{(Xi,Yi) , i > 1} are independent; furthermore this condition is plausible 
whenever the censoring is independent of the characteristics of the patient 
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under study. We point out that since Y can be a lifetime we can suppose it 
bounded. We put ||t|| = Yfj^^ \tj\ for t G M"'. 

In order to define the a-mixing property, we introduce the fohowing nota- 
tions. Denote by J^i{Z) the it— algebra generated by {Zj,i < j < k} . 

Definition 2.1 Let {Zi,i = 1,2, ...} denote a sequence of rv's. Given a 
positive integer n, set 

a{n) = sup{|P(^nS) -F(A)P(5)| : A G F\{Z) and B G A; G IN* 

The sequence is said to be a-mixing (strong mixing) if the mixing coefficient 
a{n) as n — > oo. 

There exists many processes fulfilling the strong mixing property. We quote, 
here, the usual ARMA processes which are geometrically strongly mixing, 
i.e., there exist p G (0, 1) and a > such that, for any n > 1, a{n) < ap^ 
(see, e.g., Jones [26]). The threshold models, the EXPAR models (see, Ozaki 
[38] ) , the simple ARCH models (see Engle [20] ) , their GARCH extension (see 
Bollerslev [3]) and the bilinear Markovian models are geometrically strongly 
mixing under some general ergodicity conditions. 

We suppose that the sequences {Yi, i > 1} and {Ci,i > 1} are a-mixing with 
coefficients ai(n) and a2{n), respectively. Cai ([7], Lemma 2) showed that 
{Ti,i > 1} is then strongly mixing, with coefficient 

a(n) = 4 max(ai(n), a2(n)). 

From now on, we suppose that {{Ti,5i,Xi) i = 1, n} is strongly mixing. 
Now we are in position to give our assumptions and main result. 

3. Assumptions and main result 

Let C be a compact set of IR'^ which is included in Co = |x G IR"^ | £(x) > o| . 
We will make use of the following assumptions gathered here for easy refer- 
ence: 

Al) The bandwidth /i„ satisfies: lim„^+oo nW^ = +oo and lim„^+oo ^^S ^^S 
where < p < d. 

A2) The kernel is bounded and satisfies: 

i) J-^d \\t\\ Kd{t)dt < +00, 

ii) f-^d{ti +t2 + ... + td)Kj{t)dt < +00 and J^d Kj{t)dt < +oo, 
Hi) V(t, s) G C2 \Kd{t) - Kd(s)\ < \\t - sf for 7 > 0. 
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A3) The mixing coefficient a is such that a{n) = 0{n ^) for some v > 
p + v^p^ + 3{d — 1) where p = "'^'^'^'^'^'^ . 

A4) The function defined in ((T|) is continuously differentiable. 

A5) The function r2(x) := / fxY{x,y)dy is continuously differen- 

tiable. 

A6) 3L) > such that supjj^^g,;; \lij{u,v) — i{u)i{y)\ < D where iij is the 

joint distribution of {Xi,Xj). 

AT) 39 > 0,3 ci > 0,3 C2 > 0, such that 

A8) The marginal density ^(.) is continuously differentiable and there exists 
^ > such that l{x) > i Vx G C. 



Remark 3.1 Assumption Al is very common in functional estimation both 
in independent and dependent cases. However, it must be reinforced by As- 
sumptions A3 and A7 which ensure a practical calculus of the covariance's 
terms and the convergence of the series which appear in proof of Lemma 3. 
Assumptions A2, A4, A5 and A6 are needed in the study of the bias term 
of ri^n{x) which is the kernel estimator of ri{x). We point out that we 
do not require for to be symmetric as in Guessoum and Ould-Said J23^ . 
Assumption A8 intervenes in the convergence of the kernel density. Finally, 
the boundeness of Y is made only for the simplification of the proof. It can 
be dropped while using truncation methods as in Laib and Ould-Said fWjj. 

In the sequel letter C denotes any generic constant. 

Our main result is given in the following theorem which concerns the rate 
of the almost sure uniform convergence of the regression function. 



Theorem 3.1 Under Assumptions A1-A8, we have 




sup |m„(x) — m(x)| = O max \ ^/ , h^ \ ) a.s as n ^ 00. 

x&C \ 

Remark 3.2 The rate obtained here is slightly different from that obtained 

by Guessoum and Ould-Said J2^ in the independent case, which is O (max | \l '°rafe°f ^' 

for d = 1. Their result can easily be generalized for higher dimensional co- 

variate, ie X G IR'^, by adapting their Assumptions Al and A2 to obtain 

the rate O [max {,J^^, hi}). 
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Remark 3.3 // we choose hn = O (^{logn/n)^^'^^^^ then Theorem 1 be- 
comes: 



sup |m„(x) — m{x)\ = O ) a.s. 




This is the optimal rate obtained by Liebscher 1341 in the uncensored case. 



4. Simulations Study 

First, we consider the strong mixing bidimentionnal process generated by: 



where < p < 1, (ej)i is a white noise with standard Gaussian distribution 
and Xq is a standard Gaussian rv independent of (ej)j. We also simulate 
n iid rv Cj exponentially distributed with parameter A = 1.5. It is clear 
that the process (X„,1^,C„) is stationary and strongly mixing, in fact the 
process is an AR(1) and given Xi = x, we have Yi = px -\- y/l — p'^€2, 
then, Yi N{px,l — p^). In all cases we took p = 0.9. We calculate our 
estimator based on the observed data {XiTi ,5i,) i = 1, n, by choosing a 
Gaussian kernel K. In this case, we have m(x)= IE (Yi \Xi = x) = px. In 
all cases we took /i„ satisfying Al and A7, that is hn = O ({logn/n)^^j . 
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-15 



D.5 1 -2 -1.5 




Fig 1. m(x) — px, with n — 50, 100 and 300, respectively. 



We notice that the quahty of fit increases with n (see Figure 1). 



We also consider two nonhnear cases 



sm(—X,} 



smus case, 



(5) 



Yi 



12 



-X 



parabolic case. 



(6) 



Then we have m{x)= sin(^x) for ([5]) and m(a;) = ^/3^x^ + ^(1 — p'^) — 2 
for (ED. 



imsart-generic ver. 2008/01/24 file: ejs_2008_195.tex date: February 20, 2008 



Z. Guessoum and E. Ould-Said / Regression for censored and a-mixing condition 9 




Fig 2. m{x) — sin ^x, with n = 50, 100 and 300, respectively. 





Fig 3. m{x) = -^P^ + ^(1 - P ) ~ 2, with n = 50, 100 and 300, respectively. 



Figures 2 and 3 show that the quahty of fit for the non hnear model is as 
good as in the hnear modeh 

5. Proofs 



We split the proof of the Theorem 13.11 in the following Lemmata. 
Lemma 5.1 Under Assumptions Al, A2 i) and A4, for n large enough: 
sup |IE (fi^„(x)) — ri(x)| = 0(/i„) a.s. n^+oo. 



Proof of Lemma I5.lt Observe that 



lEr-^^^^ 



VG(Ti 



iXi =u 



IE 
IE 



IE 



%i<Ci}^l |y 



Xi = n 



Yi 



G{Yi 



-IE 



I{>i<Ci}l^l 



Xi = u 
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]E[yi|Xi = u] 

m{u). 



Then, we have from ([3]) 
IE(fi,n(x)) - ri(x) = 



K cm 



x-Xi 



x-X 



IR 



1 



h 

X — U 



hr, 



IE 



ri{x) 



m{u)i{u)du — ri{x 



ri(x) 



Kd{t) [ri{x-hnt)-ri{x)]dt 



since ri = mi. 

A Taylor expansion gives 

ri{x - hnt) - ri{x) = -hnih- — (x') + ... + t^- — (x')) 

OXi OXd 

where x' is between x — h„t and x. Then 



sup |IE(fi,„(x)) - ri(x) 



sup 

x&C 



Kd{t) [riix-hnt)-n{x)]dt 



< hn sup / 



Then Assumptions Al, A2 i) and A4 , give the result. ■ 
Now, we introduce the following lemma (see Ferraty and Vieu, ^Ij Propo- 
sition A.ll ii), p. 237). 

Lemma 5.2 Let {Ui,i £ IN} be a sequence of real random variables, with 
strong mixing coefficient a{n) = 0{n~'^), u > 1, such that Vn G ]N,Vi G 
IN, 1 < i < n < +00. Then for each e > and for each q > 1 : 



P 



i=l 



>e <C 1+ 



+ nCq 



-1 f^Q 



where 5^ = J2iJ2j \cov{Ui, Uj)\ . 

Lemma 5.3 Under Assumptions A1-A7, we have 



sup |fi,„(x) - IEfi,„(x))| = O 

x€C 



'log n 
nhi 



a.s as n ^ oo. 
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Proof of Lemma 15. 3t 



C is a compact set, then it admits a covering 5 by a finite number u„ of 
balls Bk{xl, a^) centered at xl = (x* j^, ...x^ j^), k € {1, Then for all 
X ^ C there exists k S {1, ...,t;„} such that \\x — < a'J^, where a„ verifies 

o^"^ = hn"'~^^^ n~2 , (7 is the same as in Assumption A2 iii)). Since C is 
bounded there exists a constant M > such that < t"- 
Now we set, for x G C; 

It is obvious that 



n 



1 JiTi fx-Xi 



^Aj(x) = fi,„(x) -IEfi,„(x)). 



i=l 



Writing Aj(x)— Aj(x^) = Aj(x), we have clearly |Aj(x)| < 
Then, 



sup 



From Assumption A2 iii) 

(m\yi\) 1 



X - Xj 

K 

x-Xi 



Kd 



+ |Ai(4)l- 



xl-X, 



xl-Xr 



sup 



i=l 



X — XI 



hn 



< 



- '^i^cK Girp) hi 



hn 



nm ai^ 



G{tf) hl+^ 



n 



hi 



Assumption Al implies that sup^.gc YA=i'^i{^) — 0( ) 



On the other hand, let f/j = nhilS.i{x*^. In order to apply Lemma [521 we 
have to calculate St. It is clear that 
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We have 



Var{Ui) = IE 



(52 (Ti 



2 / X* - Xi 



SiTi 
G{Ti 



4-^1 

hr. 



Using the conditional expectation properties and a change of variables, we 
get 



IE 



= IE 



G2(Ti) 

X 



^2 



< 



ht 



G{tf) Jm'' 



Kl{t)r2{xl-Kt)dt. 



By a Taylor expansion around x^, under Assumptions A2 ii) and A5, we 
obtain 



From Assumption A4, 



/2 = IE2 



hr, 



IE 



h = 0{hi). 



Xi, - u 



hr, 



ri{u)dt 



Finally Var{Ui) = 0{hi). 

Now let = '^■^ j^^- \cov{Ui, Uj)\, a direct calculus of \cov{Ui, Uj)\ gives 



\cov(Ui,Uj)\ = \JEUiUj)\ = 



E 



G{Ti 

^j'^j ^*^~ '^^ 



Kd 



hn 



IE 



f 6,T, 

Km: 



Kd 



xl-X, 



hn 



< 



GiTj 
(- 



hn 



TEli^Kd 



X ^ \ 



hn J G{Yj) 



IE 



Y, 



-K 



\GiYi) "V hn 



xl-X, 



Kd 



Y, 



hn 



W.\^^Kd 



G{Yj) "V hn 



<Ch 



2d 



Assumption A6 gives 



Kd{z)Kd{t) [iij {xl - zhn, xl - thn) - iiixl - zhn)ijixl - thn)] dz dt. 
\cov{Ui,Uj)\ = 0{hf). (7) 
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On the other hand, from a result in Bosq ([4J, p. 22), we have 

\cov{Ui,Uj)\<Ca{\i-j\). (8) 

Then to evaluate S*^* the idea is to introduce a sequence of integers Wn which 
we precise below. Then we use ([7]) for the close i and j and ([8]) otherwise. 
That is 

hf + Ca{\i-j\) 



< ^EE 

< 



i-j\>w„'^ ^0<\i-j\<w„ 

C (nhl^w-n^ + Cn^a (wn) ■ 



+1, we have S^* < 0{nhi) + Cn'^a (^) . Assump- 
tion A3 and the right part of Assumption A7 yield Tn?a{^) = 0{nh'^) . 



Now choosing tD, 
tio 
So 



1 



Si* = 0{nhi). 

Finally, we have 

Sl = Sl* + nVar{U^) = 0{nhi). 
Then, for e > 0, applying Lemma 15.21 we have 



P 



E^*(^ 

i=l 



> e 



IP 



E^^ 

i=l 



> nh'^e 



< C\l + C 



e^nhi 



+ nCq 



-1 



enh^ 



(.9) 



If we replace e by Eq^ 

n 

E^^(^*') 



/logn 



n 



hi 



for all eo > in dll), we get 



P 



1=1 



'logn 



hi 



n. 



By choosing q = (logn)^^^ (6 > 0), (fTO]) becomes 



<C 1 + \nCq'^ 



^eo^nh^ logn^ 
(10) 



P 



i=l 



> £01 



'logn 



n. 



hi 



< + nCq-^ [—] (nh^ log n 



2 



< + Ce-(^+i) (logn)^(i+^) n^-'^hn^. 
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Now we can write 



P < max 

k = l,...,Vn 



i=l 



> ^01 



/logn 



n 



hi 



< 



i=l 



1=1 



> 



/log n 



n 



hi 



< CMhn 



'122-1 

(log n) ^ ' n ^ ^ ' hn 



v+l d 



We have from the left part of assumption A7 



(11) 



J2 < C(logn)"(^+^)n'"'^ 



27 n 



,70^+10 + 27+1 X 
^ 27 > 



< C(logn) 



7(,. + l) + 27 + l-^ 
27 •' 



Then, for an appropriate choice of 0, J2 is the general term of a convergent 
series. In the same way, Ji < n^~'~'^o and we can choose eg such that Ji is the 
general term of convergent series. Finally, applying Borel-Cantelli lemma, to 
(fTTj) gives the result. ■ 



Remark 5.1 We point out that the parameter 6 of Assumption A7 can he 
chosen such as: 



1 



< < 



1 



7 (3 - I/) 



7(z^ + l) + 27 + l 1-u d[7(i/ + 1) + 27 + 1]' 

This condition ensures the convergence of the series of Lemma 3. 
Lemma 5.4 Under Assumptions A 1- A3 and A6-A8, 



sup \inix) — (^ix)\ = O ^max | ^ 
Proof of Lemma 15. 4t We have 



/logn 
nhi 



7 hji 



a.s 



as n — > 00. 



sup \l{x) - ln{x)\ < sup - IE [ln{x))\ + SUp |IE - t{x)\ . 

x&c xgc xec 

By Assumptions A1-A3, A6-A8, by an analogous proof to that of Lemma 
without censoring ( that is G{Ti) = 1, 6i = 1 and 1^ = 1) and putting 
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^ = ^o^/lf weget 



sup|£„(rE)-IE(4(x))| = 



/logn 



n 



hi / ■ 



(12) 



Furthermore, under A2 i) and A8 and using a Taylor expansion, we get 
sup|IE(4(x))-^(x)| =0(/i„) 

x&C 

which permit us to conclude. 



Lemma 5.5 Under Assumptions A1-A3, A6-A8, we have 

sup \ri^n{x) - fi^n{x)\ = o ( J— a.s as n ^ oo. 



Proof of Lemma 15. 5t We have from ([3]) and @ 



\fl,n {x) - fi,n{x)\ 



nhi 



E 
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In the same way as for Theorem 2 of Cai (2001), it can be shown under A3 
that 

I — — I / / log log 11 \ 

sup (\Gn(t) — G(t)\) = O \ \ a.s. Furthermore, from the defini- 

t<rF \^ n J 

tion of inix), Lemma l5.41 Assumptions Al, A2 and A8, and the fact that 
Y is bounded we get the result. ■ 
Proof of Theorem I3.lt We have 



sup |mn(x) — m(a;)| < sup 



ri,n{x) fi^n{x) 



+ 



^ni,x) in{x) 

IE(n,n(a;)) ri{x) 



+ 



fi,„(x) lEfi,„(x)) 



^nix) ^n{x) 



+ 



^n(,x) ^nix) 

ri{x) ri{x) 



tn{x) t{x) 



< 



■ r„ , . { sup \ri,n{x) - n,„(x)| + sup |ri,„(x) - Efi „(x))| 



+ sup|IE(fi,„(a;) -ri(x))| +sup(|ri(2;)|^^i) su^\i{x) - ln{x)\ \ . (13) 
x&c xec ^ ' x<^c J 
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The kernel estimator ln{x) is almost surely bounded away from because 
of Lemma 15.41 and the fact that the second part of Assumption A8. 
Then, ([13|) in conjunction with Lemmas 15. H 15.31 15.41 and 15.51 we conclude 
the proof. ■ 
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